Language-independent Grapheme-phoneme Conversion and Word Stress Assignment as a Web Service

نویسندگان

  • Uwe D. Reichel
  • Thomas Kisler
چکیده

We introduce a new language-independent procedure for grapheme-phoneme conversion, syllabification, and word stress assignment. Grapheme-phoneme conversion and syllabification is carried out by means of fallback sequences of decision trees trained on varying context sizes. Word stress is determined within an analogy-based framework by means of a Bayes classifier. Evaluation results on six languages are presented. Furthermore, it is described, how the tool is implemented and to be accessed as a web service that is freely available for academic research.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Letter-to-Phoneme Conversion for a German Text-to-Speech System

This thesis deals with the conversion from letters to phonemes, syllabification and word stress assignment for a German text-to-speech system. In the first part of the thesis (chapter 5), several alternative approaches for morphological segmentation are analysed and the benefit of such a morphological preprocessing component is evaluated with respect to the grapheme-to-phoneme conversion algori...

متن کامل

PermA and Balloon: Tools for string alignment and text processing

Two online research tools are presented in this paper: PermA, a general-purpose string aligner which can for example be used for grapheme-to-phoneme and phonemeto-phoneme alignment, and Balloon, a text processing toolkit for German and English providing components for part-of-speech tagging, morphological analyses, and grapheme-to-phoneme conversion including syllabification and word-stress ass...

متن کامل

A Language - Independent , Data - OrientedArchitecture for Grapheme - to

We report on an implemented grapheme-to-phoneme conversion architecture. Given a set of examples (spelling words with their associated phonetic representation) in a language, a grapheme-to-phoneme conversion system is automatically produced for that language which takes as its input the spelling of words, and produces as its output the phonetic transcription according to the rules implicit in t...

متن کامل

Phonological Constraints and Morphological Preprocessing for Grapheme-to-Phoneme Conversion

Grapheme-to-phoneme conversion (g2p) is a core component of any text-to-speech system. We show that adding simple syllabification and stress assignment constraints, namely ‘one nucleus per syllable’ and ‘one main stress per word’, to a joint n-gram model for g2p conversion leads to a dramatic improvement in conversion accuracy. Secondly, we assessed morphological preprocessing for g2p conversio...

متن کامل

A language-independent, data-oriented architecture for grapheme-to-phoneme conversion

We report on an implemented grapheme to phoneme conversion architecture Given a set of examples spelling words with their associated phonetic represen tation in a language a grapheme to phoneme conversion system is automatically produced for that language which takes as its input the spelling of words and pro duces as its output the phonetic transcription according to the rules implicit in the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014